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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) 



APPLICANT: Masure, H. Robert 



Rosenow, Carsten I . 



Tuomanen, Elaine 



Wizemann, Theresa M. 



(ii) 



TITLE OF INVENTION: CHOLINE BINDING PROTEINS FOR 



ANT I -PNEUMOCOCCAL VACCINES 



(iii) 



NUMBER OF SEQUENCES: 25 



<iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: David A. Jackson, Esq. 

(B) STREET: 411 Hackensack Ave, Continental Plaza, 4th 

Floor 

(C) CITY: Hackensack 

(D) STATE: New Jersey 
<E) COUNTRY: USA 

(F) ZIP: 07601 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) " APPLICATION NUMBER: 

(B) FILING DATE: £ 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Jackson Esq., David A. 

(B) REGISTRATION NUMBER: 26,742 

(C) REFERENCE /DOCKET NUMBER: 600-1-158 

(ix) TELECOMMUNICATION INFORMATION: 
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(A) TELEPHONE: 201-487-5800 



(B) TELEFAX: 201-343-1684 



(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 amino acids 

(B) TYPE: amino, acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

C ii ) MOLECULE TYPE: pepcide 

(A) DESCRIPTION: CBP112 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Xaa Glu Asn Glu Gly Ser Thr Gin Ala Ala Thr Ser Ser Asn Met Ala 
1 5 10 15 

Lys Thr Glu His Arg Lys Ala Ala Lys Gin Val Val Asp Glu 



(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP9 0 



20 



25 



30 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 



Ala Arg Glu Phe Ser Leu Glu Lys Thr Arg 



1 



5 



10 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 
(C> STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP84 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Xaa Arg Glu Phe Ser Leu Glu Lys Thr Arg Asn lie Gly lie Met Ala 
1 5 10 15 

His Val Asp Ala Gly Lys Thr 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



20 
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(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP80 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Xaa Lys Xaa Xaa Trp Gin Xaa Lys Gin Tyr Leu Lys Glu Asp Gly Ser 
1 5 10 15 

Gin Ala Ala Asn Glu Xaa Val Phe Asp Thr Ala 
20 25 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP78 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Gin Lys lie lie Gly lie Asp Leu Gly Thr Thr Asn Ser Ala Val Ala 
15 10 15 



Val Leu Glu Gly Thr Glu Ser Lys lie lie Ala Asn Pro Glu 
20 25 30 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP7 0 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

XaaXaa Xaa Glu Val Ala Lys Xaa Ser Gin Asp Thr Thr Thr Ala Ser 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP60 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



Xaa 
1 



Asn Glu Arg Val Lys lie Val Ala Thr Leu Gly Pro Ala Val Glu 
5 10 : 15 
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Gly Arg Gly 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: CBP50 pep 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Xaa lie lie Xaa Xaa Val Tyr Ala Arg Glu Val Leu Asp Ser Arg Gly 
15 10 15 

Asn Pro 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 
{ C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: CBP112-Intl 

(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENSE : NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr Tyr Lys 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: cBP112-Int2 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Xaa Asp Asp Gin Gin Ala Glu Glu Asp Tyr Ala 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 11.: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
<C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



Trp Gin Pro Pro Arg Ala Arg lie 



1 



5 



(2) INFORMATION FOR SEQ ID NO: 12: 



(i) 



SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 16 amino acids 



(B) TYPE: amino acid 



(C) STRANDEDNESS : 



(D) TOPOLOGY: linear 

.'* 

(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: 50 KDa protein 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

lie Thr Asp Val Tyr . Ala Arg Glu Val Leu Asp Ser Arg Gly Asn Pro 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: 50 KDa protein 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: B. subtilis 

(xi) SEQUENCE DESCRIPTION: SEQ ID, NO: 13: 

lie Val Asp Val Arg Ala Arg Glu Val Leu Asp Ser Ar'g Gly Asn Pro 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA . 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 



TCCCGCGGCA ACCCAACACT TGAAGTAGAA GTTTACACTG AATCAGGTGC TTTCGGACGT 60 
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GGTATGGTTC CATCAGGAGC TTCTACTGGT GAACACGAAG CAGTTGAACT TCGCGACGGT 



120 



GACAAATCTC GTTACGGTGG TCTTGGTACA CAAAAAGCTG TTGACAACGT AAACAACATC 



180 



ATTGCTGAGG CCATCATTGG CTACGATGTA 



210 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: B. subtilis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCCCGCGGCA ACCCAACAGT TGAAGTTGAA GTATATACAG AAACAGGAGC TTTCGGCCGC 60 

GCATTAGTGC CAAGCGGAGC TTCTACAGGT CAATACGAAG CGGTTGAGCT TCGTGACGGC 12 0 

GACAAAGACC GTTACCTTGG AAAAGGCGTG TTAACAGCTG TTAACAACGT AAACGAAATC 180 

ATTGCTCCAG AGCTTCTTGG CTTTGATGTA 210 

(2) INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 
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( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO. 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 16: 

Ser Arg Gly Asn Pro Thr Leu Glu Val Glu Val Tyr Th'r Glu Ser Gly 
1 5 10 15 

Ala Phe Gly Arg Gly Met Val Pro Ser Gly Ala Ser Thr Gly Glu His 
20 25 30 

Glu Ala Val Glu Leu Arg Asp Gly Asp Lys Ser Arg Tyr Gly Gly Leu 
35 40 45 

Gly Thr Gin Lys Ala Val Asp Asn Val Asn Asn He He Ala Glu Ala 
50 55 60 

He He Gly Tyr Asp Val 
65 70 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 
<D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: B. subtilis 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ser Arg Gly Asn Pro Thr Val Glu Val Glu Val Tyr Thr Glu Thr Gly 
1 *5 10 15 

Ala Phe Gly Arg Ala Leu Val Pro Ser Gly Ala Ser Thr Gly Gin Tyr 
20 25 30 

Glu Ala Val Glu Leu Arg Asp Gly Asp Lys Asp Arg Tyr Leu Gly Lys 
35 40 45 

Gly Val Leu Thr Ala Val Asn Asn Val Asn Glu lie lie Ala Pro Glu 
50 55 60 

Leu Leu Gly Phe Asp Val 
65 70 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1511 base pairs 

(B) TYPE: nucleic acid 
.(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

. (iv) ANT I- SENSE: NO 



(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: S. pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TCCCGCGGCA ACCCAACACT TGAAGTAGAA GTTTACACTG AATCAGGTGC TTTCGGACGT 60 

GGTATGGTTC CATCAGGAGC TTCTACTGGT GAACACGAAG CAGTTGAACT TCGCGACGGT 12 0 

GACAAATCTC GTTACGGTGG TCTTGGTACA CAAAAAGCTG TTGACAACGT AAACAACATC 180 

ATTGCTGATT CTATCATTGG CTACGATGTA CGTGATCAAC AAGCTATTGA CCGTGCTATG 240 

ATCGCACTTG ACGGTACTCC TAACAAAGGT AAATTGGGTG CGAATGCAAT CCTCGGTGTG 3 00 

TCTATCGCTG TAGCTCGTGC TGCTGCTGAC TACCTTGAAA TCCCACTTTA 'CAGCTACCTT 3 60 

GGTGGATTCA ACACTAAAGT TCTTCCAACT CCAATGATGA ACATCATCAA CGGTGGTTCT 420 

CACTCTGACG CTCCAATCGC TTTCCAAGAG TTCATGATCT TGCCAGTTGG TGCGCCAACA 4 80 

TTTAAAGAAA CCCTTCGTTA CGGTGCTGAA' ATCTTCCACG CTCTTAAGAA AATCCTTAAA 540 

TCACGTGGTT TGGAAACTGC CGTAGGTGAC GAAGGTGGAT TCGCTCCTCG TTTCGAAGGA 6 00 

ACTGAAGATG GTGTTGAAAC TATCCTTGCT GCGATTGAAG CTGCTGGATA TGTACCAGGT 6 60 

AAAGACGTAT TTCTCGGATT TGACTGTGCT TCATCAGAAT TCTACGATAA AGAACGTAAA 7 20 

GTTTACGACT ACACTAAATT TGAAGGCGAA GGTGCTGCTG TTCGTACATC TGCAGAACAA 7 80 

ATCGACTACC TTGAAGAATT GGTTAACAAA TACCCAATCA TC AC TATTGA AGATGGTATG 840 

GATGAAAACG ACTGGGATGG TTGGAAAGCT CTTACTGAAC GTCTTGGTAA GAAAGTACAA 9 00 

CTTGTTGGTG ACGACTTCTT CGTAACAAAC ACTGACTACC TTGCACGTGG TATCCAAAAA . 9 60 

GGTGCTGCTA ACTCAATCCT TATCAAAGTT AACCAAATCG GTACTCTTAC TGAAACTTTT 102 0 

GAAGCTATCG AAATGGCTAA AGAAGCTGGT TACACTGCTG TTGTATCACA CCGTTCAGGT 1080 
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GAAACTGAAG ATTCAACAAT CGCTGATATC GCGATTGCAA CTAACGCAGG ACAAATCAAG 114 0 

ACTGGTTCAC TTTCACGTAC AG ACCGTATC GCTAAATACA ACCAATTGCT TCGTATCGAA 1200 

GACCAACTTG GTGAAGTAGC AGAATATCGT GGATTGAAAT CATTCTACGA CCTTAAAAAA 12 60 

TAAAATAGTA CAGTGAACTA TTTTATCCCC GAACCATGAA ATTCAAATTC CGGCCCTTGA 13 20 
TTTAACAACT TTCTAGCCCC CCGATTTATC C G AGGGG AT A TTTTTGTTTT AAAGGGCAAA ' 13 80 

AAGGACAAAC TTTTTAAAAT CTTTTCATAC ATTACCATAC TTGATGCTCC CCTCCTTCTT 1440 

GTACTGGTTT AAGAATATAC TCCATCGATG ACACCGTCGG ATGTTTAGGA TTATTCGACA 1500 

ACAAGTTGAA T 1511 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ser Arg Gly Asn Pro Thr Leu Glu Val Glu Val Tyr Thr Glu Ser Gly 
15 10 15 

Ala Phe Gly Arg Gly Met Val Pro Ser Gly Ala Ser Thr Gly Glu His 




20 

Glu Ala Val Glu Leu 
35 

Gly Thr Gin Lys Ala 
50 

lie lie Gly Tyr Asp 
65 

lie Ala Leu Asp Gly 

85 

lie Leu Gly Val Ser 
100 

Glu lie Pro Leu Tyr 
115 

Pro Thr Pro Met Met 
130 

Pro lie Ala Phe Gin 
145 

Phe Lys Glu Thr Leu 
165 

Lys lie Leu Lys Ser 
180 

Gly Phe Ala Pro Arg 
195 

Leu Ala Ala lie Glu 
210 
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25 

Arg Asp Gly Asp Lys Ser 
40 

Val Asp Asn Val Asn Asn 
55 

Val Arg Asp Gin Gin Ala 
70 75 

Thr Pro Asn Lys Gly Lys 
90 

lie Ala Val Ala Arg Ala 
105 

Ser Tyr Leu Gly Gly Phe 
120 

Asn lie lie Asn Gly Gly 
135 

Glu Phe Met lie Leu Pro 
150 155 

Arg Tyr Gly Ala Glu lie 
170 

Arg Gly Leu Glu Thr Ala 
185 

Phe Glu Gly Thr Glu Asp 
200 

Ala Ala Gly Tyr Val Pro 
215 




30 

Arg Tyr Gly Gly Leu 
45 

lie lie Ala Asp Ser 
60 

lie Asp Arg Ala Met 
80 

Leu Gly Ala Asn Ala 
95 

Ala Ala Asp Tyr Leu 
•' 110 

Asn Thr Lys Val Leu 
125 

Ser His Ser Asp Ala 
140 

Val Gly Ala Pro Thr 
160 

Phe His Ala Leu Lys 
175 

Val Gly Asp Glu Gly 
190 

Gly Val Glu Thr lie 
205 

Gly Lys Asp Val Phe 
220 



Leu Gly Phe Asp Cys Ala Ser Ser Glu Phe Tyr Asp Lys Glu Arg Lys 
225 230 235 240 
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Val Tyr Asp Tyr Thr Lys Phe Glu Gly Glu Gly Ala Ala Val Arg Thr 
245 250 255 

Ser Ala Glu Gin lie Asp Tyr Leu Glu Glu Leu Val Asn Lys Tyr Pro 
260 265 270 

lie lie Thr lie Glu Asp Gly Met Asp Glu Asn Asp Trp Asp Gly Trp 
275 280 285 

Lys Ala Leu Thr Glu Arg Leu Gly Lys Lys Val Gin Leu Val Gly Asp 
290 295 300 

Asp Phe Phe Val Thr Asn Thr Asp Tyr Leu Ala Arg Gly lie Gin Lys 
305 310 , 315 320 

Gly Ala Ala Asn Ser lie Leu lie Lys Val Asn Gin He Gly Thr Leu 
325 330 335 

Thr Glu Thr Phe Glu Ala He Glu Met Ala Lys Glu Ala Gly Tyr Thr 
340 345 350 

Ala Val. Val Ser His Arg Ser Gly Glu Thr Glu Asp Ser Thr lie Ala 
355 360 365 

Asp lie Ala He Ala Thr Asn Ala Gly Gin He Lys Thr Gly Ser Leu 
370 375 380 

Ser Arg Thr Asp Arg He Ala Lys Tyr Asn Gin Leu Leu Arg He Glu 
385 390 395 400 

Asp Gin Leu Gly Glu : Val Ala Glu Tyr Arg Gly Leu Lys Ser Phe Tyr 
405 410 415 

Asp Leu Lys Lys 
420 



(2) INFORMATION FOR SEQ ID NO: 20: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 9 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AAAAAAGGTA GCAGAAGCTG AGAAGAAGGT TGAAGAAGCT GAGAAAAAAG CCAAGGATCA 60 

AAAAGAAGAA GATCGCCGTA ACTACCCAAC CAATACTTAC AAAACGCTTG ACCTTGAAAT 12 0 

TGCTGAGTCC GATGTGAAAG TTAAAGAAGC GGAGCTTGAA CTAGTAAAAG AGGAAGCTAA 180 

GGAACCTCGA GACGAGGAAA AAATTAAGCA AGCAAAAGCG AAAGTTGAGA GTAAAAAAGC 24 0 

TGAGGCTACA AGGTTAGAAA ACATCAAGAC AGATCGTAAA AAAGCAGAAG AAGAAGCTAA 300 

ACGAAAAGCA GCAGAAGAAG ATAAAGTTAA AGAAAAACCA GCTGAACAAC CACAACCAGC 3 60 

GCCGGTTACT CAACCAGAAA AACCAGCTCC AAAACCAGAG AAGCCAGCTG AACAACCAAA 42 0 

AGCAGAAAA 42 9 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 2 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear » 
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(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala GIu Lys Lys 
1 5 10 15 

Ala Lys Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr 
20 25 30 

Tyr Lys Thr Leu Asp Leu Glu lie Ala Glu Ser Asp Val Lys Val Lys 
35 40 45 

Glu Ala Glu Leu Glu Leu VaT Lys Glu Glu Ala Lys Glu Pro Arg Asp 
50 55 60 

Glu Glu Lys lie Lys Gin Ala Lys Ala Lys Val Glu Ser Lys Lys Ala 
65 70 75 80 

Glu Ala Thr Arg Leu Glu Asn lie Lys Thr Asp Arg Lys Lys Ala Glu 
85 90 95 

Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys Val Lys Glu Lys 
100 105 110 

Pro Ala Glu Gin Pro Gin Pro Ala Pro Val Thr Gin Pro Glu Lys Pro 
115 120 125 

Ala Pro Lys Pro Glu Lys Pro Ala Glu Gin Pro Lys Ala Glu 
130 135 140 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 27 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer" 

( iii ) HYPOTHETICAL : NO 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID. NO:22: 



GCTCTTNCTC GATGTCTCNG TNGCCAT 



27 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 30 base pairs 
(B) TYPE: nucleic acid 



(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: other nucleic acid 



(A) DESCRIPTION: /desc = "oligonucleotide primer" 



(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 
AGCATAGTCT TCTTCGACTT gttgatcatc 



30 
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(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CCAAGCAGCC ACTTCTTCTA ATATGGCAAA GACAGAACAT AGGAAAGCTG CTAAACAAGC 60 

GTCGATGAAT ATATAGAAAA AATGATGAGG GAGATTCAAC TAGATAGAAG AAAACATACC 12 0 

CAAAATGTCG CCTTAAACAT AAAGTTGAGC GCAATTAAAA CGAAGTATTT GCGTGAATTA 18 0 

AATGTTTTAG AAGAGAAGTC GAAAGATGAG TTGCCGTCAG AAATAAAAGC AAAGTTAGAC 24 0 

GCAGCTTTTG AGAAGTTTAA AAAAGATACA TTGAAACCAG GAGAAAAGGT AGCAGAAGCT 3 00 

AAGAAGAAGG TTGAAGAAGC TAAGAAAAAA GCCGAGGATC AAAAAGAAGA AGATCGTCGT 3 60 

AACTACCCAA CCAATACTTA CAAAACGCTT GAACTTGAAA TTGCTGAGTT CGATGTGAAA 42 0 

GTTAAAGAAG CGGAGCTTGA ACTAGTAAAA GAGGAAGCTA AAGAATCTCG AAACGAGGGC 48 0 

ACAATTAAGC AAG C AAAAG A GAAAGTTGAG AGTAAAAAAG CTGAGGCTAC AAGGTTAGAA 540 

AACATCAAGA CAGATCGTAA AAAAGCAGAA GAAGAAGCTA AACGAAAAGC AGATGCTAAG 600 

TTGAAGGAAG CTAATGTAGC GACTTCAGAT CAAGGTAAAC CAAAGGGGCG GGC AAAACGA 6 60 

GGAGTTCCTG GAGAGCTAGC AACACCTGAT AAAAAAGAAA ATGATGCGAA GTCTTCAGAT 72 0 
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TCTAGCGTAG GTGAAGAAAC TCTTCCAAGC TCATCCCTGA AATCAGGAAA AAAGGTAGCA 7 80 

GAAGCTGAGA AGAAGGTTGA AGAAGCTGAG AAAAAAGCCA AGGATCAAAA AGAAGAAGAT 840 

CGCCGTAACT ACCCAACCAA TACTTACAAA ACGCTTGACC TTGAAATTGC TGAGTCCGAT 9 00 

GTGAAAGTTA AAGAAGCGGA GCTTGAACTA GTAAAAGAGG AAGCTAAGGA ACCTCGAGAC 9 60 

GAGGAAAAAA TTAAGCAAGC AAAAGCGAAA GTTGAGAGTA AAAAAGCTGA GGCTACAAGG 1020 

TT AG AAAAC A TCAAGACAGA TCGTAAAAAA GCAGAAGAAG AAGCTAAACG AAAAGCAGCA 1080 

GAAGAAGATA AAGTTAAAGA AAAACCAGCT GAACAACCAC AACCAGCGCC GGCTACTCAA 1140 

CCAGAAAAAC CAGCTCCAAA ACCAGAGAAG CCAGCTGAAC AACCAAAAGC AGAAAAAACA 12 00 

GATGATCAAC AAGCTGAAGA AGACTATGCT CGTAGATCAG AAGAAGAATA TAATCGCTTG 12 60 

ACTCAACAGC AACCGCCAAA AACTGAAAAA CCAGCACAAC CATCTACTCC AAAAACAGGC 13 2 0 

TGGAAACAAG AAAACGGTAT GTGGTACTTC TACAATACTG ATGGTTCAAT GGCAACAGGA 13 80 

TGGCTCCAAA ACAACGGTTC ATGGTACTAT CTAAACGCTA ATGGTGCTAT GGCGACAGGA 144 0 

TGGC TC C AAA ACAATGGTTC ATGGTACTAT CTAAACGCTA ATGGTTCAAT GGCAACAGGA 15 00 

TGGCTCCAAA ACAATGGTTC ATGGTACTAC CTAAACGCTA ATGGTGCTAT GGCGACAGGA 1560 

TGGCTCCAAT ACAATGGTTC ATGGTACTAC CTAAACAGC A ATGGCGCTAT GGCGACAGGA 162 0 

TGGCTCCAAT ACAATGGCTC ATGGTACTAC CTCAACGCTA ATGGTGATAT GGCGACAGGA 16 8 0 

TGGCTCCAAA ACAACGGTTC ATGGTACTAC CTCAACGCTA ATGGTGATAT GGCGACAGGA 17 4 0 

TGGCTCCAAT ACAACGGTTC ATGGT AT T AC CTCAACGCTA ATGGTGATAT GGCGACAGGT 1800 

TGGGTGAAAG ATGGAGATAC CTGGTAC TAT CTTGAAGCAT CAGGTGCTAT GAAAGCAAGC 18 60 

CAATGGTTCA AAGTATCAGA TAAATGGTAC TATGTCAATG GCTCAGGTGC CCTTGC AGTC 192 0 

AACACAACTG TAGATGGCTA TGGAGTCAAT GCCAATGGTG AATGGGTAAA CTAAACCTAA 1980 
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TATAACTAGT TAATACTGAC TTCCTGTAAG AACTTTTTAA AGTATTCCCT ACAAATACCA 2040 

TATCCTTTCA GTAGATAATA TACCCTTGTA GGAAGTTTAG ATTAAAAAAT AACTCTGTAA 2100 

TCTCTAGCCG GATTTATAGC GCTAGAGACT ACGGAGTTTT TTTGATGAGG AAAGAATGGC 2160 

GGCATTCAAG AGACTCTTTA AGAGAGTTAC GGGTTTTAAA CTATTAAGCC TTCTCCAATT 222 0 

GCAAGAGGCT TCAATCTCTG CTAGGGTGCT AGCTTGCGAA ATGGCTCCAC GGAGTTTGGC 22 80 

AGCGCCAGAT GTTCCACGGA GATAGTGAGG AGCGAGGCCG CGGAATTCAC GAACTGCGAC 23 4 0 

GTTTTCTCCT TTGAGGTTAA TCAATCGTTT CAAGTGTTCG TAGGCGATCT TCATCTTGTC 24 00 

TTCAAAGGTC AAATCAGGTA GGATTTCTCC TGTTTCAAAG TTTATGGTGG CCCTGGTTGA 24 60 

AG 2462 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: N-terminal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Met Arg Glu lie Gin Leu Asp Arg Arg Lys His Thr Gin Asn Val 
1 5 .10 ,15 

Ala Leu Asn lie Lys Leu Ser Ala lie Lys Thr Lys Tyr Leu Arg Glu 
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20 25 30 

Leu Asn Val Leu Glu Glu Lys Ser Lys Asp Glu Leu Pro Ser Glu lie 
35 40 45 

Lys Ala Lys Leu Asp Ala Ala Phe Glu Lys Phe Lys Lys Asp Thr Leu 
50 55 60 

Lys Pro Gly Glu Lys Val Ala Glu Ala Lys Lys Lys Val Glu Glu Ala 
65 70 75 80 

Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro 
85 90 9 5 

Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Phe Asp Val 
100 105 '* 110 

Lys Val Lys Glu Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu 
115 120 125 

Ser Arg Asn Glu Gly Thr lie Lys Gin Ala Lys Glu Lys Val Glu Ser 
130 135- 140, 

Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn lie Lys Thr Asp Arg Lys 
145 150 155 160 

Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Asp Ala Lys Leu Lys Glu 
165 170 175 

Ala Asn Val Ala Thr Ser Asp Gin Gly Lys Pro Lys Gly Arg Ala Lys 
180 185 190 

Arg Gly Val Pro Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu Asn Asp 
195 200 205 

Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Pro. Ser Ser 
210 215 220 

Ser Leu Lys Ser Gly Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu 
225 230 235 240 
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Glu Ala Glu Lys Lys Ala Lys Asp Gin Lys Glu Glu Asp Arg Arg Asn 
245 250 255 

Tyr Pro Thr Asn Thr Tyr Lys Thr Leu Asp Leu Glu lie Ala Glu Ser 
260 265 270 

Asp Val Lys Val Lys Glu Ala Glu Leu Glu Leu Val Lys Glu Glu Ala 
275 280 285 

Lys Glu Pro Arg Asp Glu Glu Lys lie Lys Gin Ala Lys Ala Lys Val 
290 295 300 

Glu Ser Lys Lys Ala Glu Ala Thr Arg Leu Glu Asp He Lys Thr Asp 
305 310 , 315 320 

Arg Lys Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp 
325 330 335 

Lys Val Lys Glu Lys Pro Ala Glu Gin Pro Gin Pro Ala Pro Ala Thr 
340 345 350 

Gin Pro Glu Lys Pro Ala Pro Lys Pro Glu Lys Pro Ala Glu Gin Pro 
355 360 365 

Lys Ala Glu Lys Thr Asp Asp Gin Gin Ala Glu Glu Asp Tyr Ala Arg 
370 375 380 

Arg Ser Glu Glu Glu Tyr Asn Arg Leu Thr Gin Gin Gin Pro Pro Lys 
385 390 395 400 

Thr Glu Lys Pro Ala Gin Pro Ser Thr Pro Lys Thr Gly TrpLys Gin 
405 410 415 

Glu Asn Gly Met Trp Tyr Phe Tyr Asn Thr Asp Gly Ser Met Ala Thr 
420 425 430 

Gly Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly 
435 440 445 



Ala Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr Leu 
450 455 460 
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Asn Ala Asn Gly Ser Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser 
465 470 475 480 

Trp Tyr Tyr Leu Asn Ala Asn Gly Ala Met Ala Thr Gly Trp Leu Gin 
485 490 495 

Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ser Asn Gly Ala Met Ala Thr 
500 505 510 

Gly Trp Leu Gin Tyr Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly 
515 520 525 

Asp Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp. Tyr Tyr Leu 
530 535 , 540 

Asn Ala Asn. Gly Asp Met Ala Thr Gly Trp Leu Gin T^r Asn Gly Ser 
545 550 555 560 

Trp Tyr Tyr Leu Asn Ala Asn Gly Asp Met Ala Thr Gly Trp Val Lys 
565 570 575 

Asp Gly Asp Thr Trp Tyr Tyr Leu Glu Ala Ser Gly Ala Met Lys Ala 
580 585 590 

Ser Gin Trp Phe Lys Val Ser Asp Lys Trp Tyr Tyr Val Asn Gly Ser 
595 600 605 

Gly Ala Leu Ala Val Asn Thr Thr Val Asp Gly Tyr Gly Val Asn Ala 
610 615 620 



Asn Gly Glu Trp Val Asn Glx 
625 630 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Masure, H. Robert 

Rosenow, Carsten I. 
Tuomanen , E 1 a ine 
Wizemann, Theresa M. 



(ii) TITLE OF INVENTION: CHOLINE BINDING PROTEINS FOR 
ANT I - PNEUMOCOCCAL VACCINES 

(iii) NUMBER OF SEQUENCES: 25 



jCy <iv) CORRESPONDENCE ADDRESS: 

3*1 1 

"~ (A) ADDRESSEE: David A. Jackson, Esq. 

id (B) STREET: 411 Hackensack Ave, Continental Plaza, 4th 
"=n Floor 

» (C) CITY: Hackensack 

v ^ (D) STATE: New Jersey 

Q ( E ) COUNTRY: USA 

(F) ZIP : 07601 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS -DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.3 0 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Jackson Esq., David A. 
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(B) REGISTRATION NUMBER: 26,742 

(C) REFERENCE /DOCKET NUMBER: 600-1-158 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 201-487-5800 

(B) TELEFAX: 201-343-1684 



(2) INFORMATION FOR SEQ ID NO : 1 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: CBP112 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Xaa Glu Asn Glu Gly Ser Thr Gin Ala Ala Thr Ser Ser Asn Met Ala 
1 5 10 15 

Lys Thr Glu His Arg Lys Ala Ala Lys Gin Val Val Asp Glu 
20 25 30 



(2) INFORMATION FOR SEQ ID NO : 2 : 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 
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( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP90 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Ala Arg Glu Phe Ser Leu Glu Lys Thr Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP84 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

Xaa Arg Glu Phe Ser Leu Glu Lys Thr Arg Asn lie Gly lie Met Ala 
1 5 10 15 



:J 
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His Val Asp Ala Gly Lys Thr 
20 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP80 

(iii) HYPOTHETICAL : NO 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Xaa Lys Xaa Xaa Trp Gin Xaa Lys Gin Tyr Leu Lys Glu Asp Gly Ser 
15 10 15 

Gin Ala Ala Asn Glu Xaa Val Phe Asp Thr Ala 
20 25 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP78 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Gin Lys He He Gly He Asp Leu Gly Thr Thr Asn Ser Ala Val Ala 
1 5 10 15 

Val Leu Glu Gly Thr Glu Ser Lys He He Ala Asn Pro Glu 
20 25 30 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP70 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 



Xaa Xaa Xaa Glu Val Ala Lys Xaa Ser Gin Asp Thr Thr Thr Ala Ser 
15 10 15 
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(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(A) DESCRIPTION: CBP6 0 

(iii) HYPOTHETICAL: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 



Xaa Asn Glu Arg Val Lys He Val Ala Thr Leu Gly Pro Ala Val Glu 



Gly Arg Gly 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 
<B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: CBP5 0 pep 



(iv) 



ANTI- SENSE: NO 



1 



5 



10 



15 



(iii) 



HYPOTHETICAL: NO 



# 
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(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Xaa He He Xaa Xaa Val Tyr Ala Arg Glu Val Leu Asp Ser Arg Gly 
! 5 10 15 

Asn Pro 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 
40 (B) TYPE: amino acid 

'l^ (C) STRANDEDNESS : 

tU (D) TOPOLOGY: linear 

■s (ii) MOLECULE TYPE: peptide 

*~ (A) DESCRIPTION: cBP112-Intl 



(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr Tyr Lys 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 
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(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: cBP112-Int2 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Xaa Asp Asp Gin Gin Ala Glu Glu Asp Tyr Ala 
15 10 



) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
Trp Gin Pro Pro Arg Ala Arg lie 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(A) DESCRIPTION: 50 KDa protein 

( v) FRAGMENT TYPE: N- terminal 

..fsj (vi) ORIGINAL SOURCE: 

vD (A) ORGANISM: S. pneumoniae 

HYPOTHET I CAL : NO 
ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 12 : 

lie Thr Asp Val Tyr Ala Arg Glu Val Leu Asp Ser Arg Gly Asn Pro 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(iii) 
(iv) 



(ii) 



MOLECULE TYPE : peptide 

(A) DESCRIPTION: 50 KDa protein 
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(iii) HYPOTHETICAL.: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: B. subtilis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

lie Val Asp Val Arg Ala Arg Glu Val Leu Asp Ser Arg Gly Asn Pro 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 14: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: S. pneumoniae 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 14 : 



TCCCGCGGCA ACCCAACACT TGAAGTAGAA GTTTACACTG AATCAGGTGC TTTCGGACGT 




99 



GGTATGGTTC CATCAGGAGC TTCTACTGGT GAACAGGAAG CAGTTGAACT TCGCGACGGT 
GACAAATCTC GTTACGGTGG TCTTGGTACA CAAAAAGCTG TTGACAACGT AAACAACATC 
ATTGCTGAGG CCATCATTGG CTACGATGTA 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 210 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY :* linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: B. subtilis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
TCCCGCGGCA ACCCAACAGT TGAAGTTGAA GTATATACAG AAACAGGAGC TTTCGGCCGC 
GCATTAGTGC CAAGCGGAGC TTCTACAGGT CAATACGAAG CGGTTGAGCT TCGTGACGGC 
GACAAAGACC GTTACCTTGG AAAAGGCGTG TTAACAGCTG TTAACAACGT AAACGAAATC 
ATTGCTCCAG AGCTTCTTGG CTTTGATGTA 



(2) INFORMATION FOR SEQ ID NO: 16: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Ser Arg Gly Asn Pro Thr Leu Glu Val Glu Val Tyr Thr Glu Ser Gly 
1 5 10 « 

Ala Phe Gly Arg Gly Met Val Pro Ser Gly Ala Ser Thr Gly Glu His 
20 25 30 

Glu Ala Val Glu Leu Arg Asp Gly Asp Lys Ser Arg Tyr Gly Gly Leu 
35 40 45 

Gly Thr Gin Lys Ala Val Asp Asn Val Asn Asn lie lie Ala Glu Ala 
50 55 60 

lie lie Gly Tyr Asp Val 
65 70 



(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 



") 
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(A) LENGTH: 70 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: B. subtilis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Ser Arg Gly Asn Pro Thr Val Glu Val Glu Val Tyr Thr Glu Thr Gly 
x 5 10 15 

Ala Phe Gly Arg Ala Leu Val Pro Ser Gly Ala Ser Thr Gly Gin Tyr 
20 25 30 

Glu Ala Val Glu Leu Arg Asp Gly Asp Lys Asp Arg Tyr Leu Gly Lys 
35 40 45 

Gly Val Leu Thr Ala Val Asn Asn Val Asn Glu lie lie Ala Pro Glu 
50 55 60 



Leu Leu Gly Phe Asp Val 
65 70 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1511 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: B. subtilis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

TCCCGCGGCA ACCCAACACT TGAAGTAGAA GTTTACACTG AATCAGGTGC TTTCGGACGT 6 0 

GGTATGGTTC CATCAGGAGC TTCTACTGGT GAACACGAAG CAGTTGAACT TCGCGACGGT 12 0 

GACAAATCTC GTTACGGTGG TCTTGGTACA CAAAAAGCTG TTGACAACGT AAACAACATC 18 0 

ATTGCTGATT CTATCATTGG CTACGATGTA CGTGATCAAC AAGCTATTGA CCGTGCTATG 24 0 

ATCGCACTTG ACGGTAC TCC TAACAAAGGT AAATTGGGTG CGAATGCAAT CCTCGGTGTG 3 00 

TCTATCGCTG TAGCTCGTGC TGCTGCTGAC TACCTTGAAA TCCCACTTTA CAGCTACCTT 3 60 

GGTGGATTCA ACACTAAAGT TCTTCCAACT CCAATGATGA ACATCATCAA CGGTGGTTCT 42 0 

CACTCTGACG CTCCAATCGC TTTCCAAGAG TTCATGATCT TGCCAGTTGG TGCGCCAACA 480 

TTTAAAGAAA CCCTTCGTTA CGGTGCTGAA ATCTTCCACG CTCTTAAGAA AATCCTTAAA 54 0 

TCACGTGGTT TGGAAACTGC CGTAGGTGAC GAAGGTGGAT TCGCTCCTCG TTTCGAAGGA 6 00 

ACTGAAGATG GTGTTGAAAC TATCCTTGCT GCGATTGAAG CTGCTGGATA TGTACCAGGT 66 0 
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AAAGACGTAT TTCTCGGATT TGACTGTGCT TCATCAGAAT TCTACGATAA AGAACGTAAA 72 0 

GTTTACGACT AC AC T AAATT TGAAGGCGAA GGTGCTGCTG TTCGTAC AT C TGCAGAACAA 78 0 

ATCGACTACC TTGAAGAATT GGTTAACAAA TACCCAATCA TCACTATTGA AGATGGTATG 84 0 

GATGAAAACG ACTGGGATGG TTGGAAAGCT CTTACTGAAC GTCTTGGTAA GAAAGTACAA 900 

CTTGTTGGTG ACGACTTCTT CGTAACAAAC ACTGACTACC TTGCACGTGG TATCCAAAAA 96 0 

GGTGCTGCTA ACTCAATCCT TATCAAAGTT AACCAAATCG GTACTCTTAC TGAAACTTTT 102 0 

GAAGCTATCG AAATGGCTAA AGAAGCTGGT TACACTGCTG TTGTATCACA CCGTTCAGGT 1080 

GAAACTGAAG ATTCAACAAT CGCTGATATC GCGATTGCAA CTAACGCAGG ACAAATCAAG 114 0 

ACTGGTTCAC TTTCACGTAC AGACCGTATC G CTAAATAC A ACCAATTGCT TCGTATCGAA 12 0 0 

GACCAACTTG GTGAAGTAGC AGAATATCGT GGATTGAAAT CATTCTACGA CCTTAAAAAA 12 6 0 

TAAAATAGTA CAGTGAACTA TTTTATCCCC GAACCATGAA ATTCAAATTC CGGCCCTTGA 13 2 0 

TTTAACAACT TTCTAGCCCC CCGATTTATC CGAGGGGATA TTTTTGTTTT AAAGGGCAAA 13 8 0 

AAGGACAAAC TTTTTAAAAT CTTTTCATAC ATTACCATAC TTGATGCTCC CCTCCTTCTT 144 0 

GTACTGGTTT AAGAATATAC TCCATCGATG ACACCGTCGG ATGTTTAGGA TTATTCGACA 1500 

ACAAGTTGAA T 1511 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 420 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI - SENSE : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: B. subtilis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Ser Arg Gly Asn Pro Thr Leu Glu Val Glu Val Tyr Thr Glu Ser Gly 
1 5 10 15 

Ala Phe Gly Arg Gly Met Val Pro Ser Gly Ala Ser Thr Gly Glu His 
20 25 30 

Glu Ala Val Glu Leu Arg Asp Gly Asp Lys Ser Arg Tyr Gly Gly Leu 
35 40 45 

Gly Thr Gin Lys Ala Val Asp Asn Val Asn Asn lie lie Ala Asp Ser 
50 55 60 

lie lie Gly Tyr Asp Val Arg Asp Gin Gin Ala lie Asp Arg Ala Met 
65 70 75 80 

lie Ala Leu Asp Gly Thr Pro Asn Lys Gly Lys Leu Gly Ala Asn Ala 
85 90 95 

lie Leu Gly Val Ser lie Ala Val Ala Arg Ala Ala Ala Asp Tyr Leu 
100 105 HO 

Glu lie Pro Leu Tyr Ser Tyr Leu Gly Gly Phe Asn Thr Lys Val Leu 
115 120 125 



1 
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Pro Thr Pro Met Met Asn lie He Asn Gly Gly Ser His Ser Asp Ala 
130 135 140 

Pro He Ala Phe Gin Glu Phe Met He Leu Pro Val Gly Ala Pro Thr 
145 150 155 160 

Phe Lys Glu Thr Leu Arg Tyr Gly Ala Glu He Phe His Ala Leu Lys 
165 170 175 

Lys He Leu Lys Ser Arg Gly Leu Glu Thr Ala Val Gly Asp Glu Gly 
180 185 190 

Gly Phe Ala Pro Arg Phe Glu Gly Thr Glu Asp Gly Val Glu Thr lie 
195 200 205 

Leu Ala Ala He Glu Ala Ala Gly Tyr Val Pro Gly Lys Asp Val Phe 
210 215 220 

Leu Gly Phe Asp Cys Ala Ser Ser Glu Phe Tyr Asp Lys Glu Arg Lys 
225 230 235 240 

Val Tyr Asp Tyr Thr Lys Phe Glu Gly Glu Gly Ala Ala Val Arg Thr 
245 250 255 

Ser Ala Glu Gin He Asp Tyr Leu Glu Glu Leu Val Asn Lys Tyr Pro 
260 265 270 

He He Thr He Glu Asp Gly Met Asp Glu Asn Asp Trp Asp Gly Trp 
275 280 285 

Lys Ala Leu Thr Glu Arg Leu Gly Lys Lys Val Gin Leu Val Gly Asp 
290 295 300 

Asp Phe Phe Val Thr Asn Thr Asp Tyr Leu Ala Arg Gly He Gin Lys 
305 310 315 320 



Gly Ala Ala Asn Ser He Leu He Lys Val Asn Gin He Gly Thr Leu 



) 
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325 330 335 

Thr Glu Thr Phe Glu Ala lie Glu Met Ala Lys Glu Ala Gly Tyr Thr 
340 345 350 

Ala Val Val Ser His Arg Ser Gly Glu Thr Glu Asp Ser Thr lie Ala 
355 360 365 

Asp lie Ala lie Ala Thr Asn Ala Gly Gin lie Lys Thr Gly Ser Leu 
370 375 380 

Ser Arg Thr Asp Arg He Ala Lys Tyr Asn Gin Leu Leu Arg He Glu 
385 390 395 400 

Asp Gin Leu Gly Glu Val Ala Glu Tyr Arg Gly Leu Lys Ser Phe Tyr 
405 410 415 

Asp Leu Lys Lys 
420 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE : NO 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
AAAAAAGGTA GCAGAAGCTG AGAAGAAGGT TGAAGAAGCT GAGAAAAAAG CCAAGGATCA 
AAAAGAAGAA GATCGCCGTA ACTACCCAAC CAATACTTAC AAAACGCTTG AC CTTGAAAT 
TGCTGAGTCC GATGTGAAAG TTAAAGAAGC GGAGCTTGAA CTAGTAAAAG AGGAAGCTAA 
GGAACCTCGA GACGAGGAAA AAATTAAGCA AGCAAAAGCG AAAGTTGAGA GTAAAAAAGC 
TGAGGCTACA AGGTTAGAAA AC AT CAAG AC AGATCGTAAA AAAGCAGAAG AAGAAGCTAA 
ACGAAAAGCA GCAGAAGAAG ATAAAGTTAA AGAAAAACCA GCTGAACAAC CACAACCAGC 
G CCGGTT ACT CAACCAGAAA AACCAGCTCC AAAACCAGAG AAGCCAGCTG AACAACCAAA 
AGCAGAAAA 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: S. pneumoniae 




J' 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Lys Lys Val Ala Glu Ala Glu Lys Lys Val Glu Glu Ala Glu Lys Lys 
15 10 15 

Ala Lys Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro Thr Asn Thr 
20 25 30 

Tyr Lys Thr Leu Asp Leu Glu lie Ala Glu Ser Asp Val Lys Val Lys 
35 40 45 

Glu Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu Pro Arg Asp 
50 55 60 

Glu Glu Lys lie Lys Gin Ala Lys Ala Lys Val Glu Ser Lys Lys Ala 
65 70 75 80 

Glu Ala Thr Arg Leu Glu Asn lie Lys Thr Asp Arg Lys Lys Ala Glu 
85 90 95 

Glu Glu Ala Lys Arg Lys Ala Ala Glu Glu Asp Lys Val Lys Glu Lys 
100 105 110 

Pro Ala Glu Gin Pro Gin Pro Ala Pro Val Thr Gin Pro Glu Lys Pro 
115 120 125 

Ala Pro Lys Pro Glu Lys Pro Ala Glu Gin Pro Lys Ala Glu 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 




) 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide primer" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GCTCTTNCTC GATGTCTCNG TNGCCAT 27 



(2) INFORMATION FOR SEQ ID NO: 23: 



(i) 



SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 3 0 base pairs 



(B) TYPE: nucleic acid 



(C) STRANDEDNESS : single 



( D ) TOPOLOGY : 1 inear 



(ii) 



MOLECULE TYPE: other nucleic acid 



(A) DESCRIPTION: /desc = "oligonucleotide primer 



ii 



(iii) 



HYPOTHETICAL: NO 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



AGCATAGTCT TCTTCGACTT GTTGATCATC 



30 



(2) INFORMATION FOR SEQ ID NO: 24: 



) 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CCAAGCAGCC ACTTCTTCTA ATATGGCAAA GACAGAACAT AGGAAAGCTG CTAAACAAGC 6 0 

GTCGATGAAT ATATAGAAAA AATGATGAGG GAGATTCAAC TAGATAGAAG AAAAC AT AC C 12 0 

CAAAATGTCG CCTTAAACAT AAAGTTGAGC GCAATTAAAA CGAAGTATTT GCGTGAATTA 18 0 

AATGTTTTAG AAGAGAAGTC GAAAGATGAG TTGCCGTCAG AAATAAAAGC AAAGTTAGAC 24 0 

GCAGCTTTTG AGAAGTTTAA AAAAGATACA TTGAAAC C AG GAGAAAAGGT AGCAGAAGCT 3 00 

AAGAAGAAGG TTGAAGAAGC TAAGAAAAAA GCCGAGGATC AAAAAGAAGA AGATCGTCGT 3 60 

AACTACCCAA CCAATACTTA CAAAACGCTT GAACTTGAAA TTGCTGAGTT CGATGTGAAA 42 0 

GTTAAAGAAG CGGAGCTTGA ACTAGTAAAA GAGGAAGCTA AAGAATCTCG AAACGAGGGC 4 80 

ACAATTAAGC AAGCAAAAGA GAAAGTTGAG AGTAAAAAAG CTGAGGCTAC AAGGTTAGAA 54 0 

AACATCAAGA CAGATCGTAA AAAAGCAGAA GAAGAAGCTA AACGAAAAGC AGATGCTAAG 6 00 

TTGAAGGAAG CTAATGTAGC GACTTCAGAT CAAGGTAAAC CAAAGGGGCG GGCAAAACGA 660 



/ 
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GGAGTTCCTG GAGAGCTAGC AACACCTGAT AAAAAAGAAA ATGATGCGAA GTCTTCAGAT 720 

TCTAGCGTAG GTGAAGAAAC TCTTCCAAGC TCATCCCTGA AATCAGGAAA AAAGGTAGCA 78 0 

GAAGCTGAGA AGAAGGTTGA AGAAGCTGAG AAAAAAGCCA AGGATCAAAA AGAAGAAGAT 84 0 

CGCCGTAACT ACCCAACCAA TACTTACAAA ACGCTTGACC TTGAAATTGC TGAGTCCGAT 9 00 

GTGAAAGTTA AAGAAG CGGA GCTTGAACTA GTAAAAGAGG AAGCTAAGGA ACCTCGAGAC 96 0 

GAGGAAAAAA TTAAGCAAGC AAAAGCGAAA GTTGAGAGTA AAAAAGCTGA GGCTACAAGG 1020 

TTAGAAAACA TCAAGACAGA TCGTAAAAAA GCAGAAGAAG AAG CTAAACG AAAAGCAGCA 1080 

GAAGAAGATA AAGTTAAAGA AAAACCAGCT GAACAACCAC AACCAGCGCC GGCTACTCAA 114 0 

CCAGAAAAAC CAGCTCCAAA AC C AG AG AAG CCAGCTGAAC AACCAAAAGC AGAAAAAACA 12 00 

GATGATCAAC AAGCTGAAGA AGACTATGCT CGTAGATCAG AAGAAGAATA TAATCGCTTG 12 6 0 

ACTCAAC AG C AACCGCCAAA AACTGAAAAA CCAGCACAAC CATCTACTCC AAAAACAGGC 132 0 

TGGAAACAAG AAAACGGTAT GTGGTACTTC TACAATACTG ATGGTTCAAT GGCAACAGGA 13 8 0 

TGGCTCCAAA ACAACGGTTC ATGGTACTAT CTAAACGCTA ATGGTGCTAT GGCGACAGGA 144 0 

TGGCTCCAAA ACAATGGTTC ATGGTACTAT CTAAACGCTA ATGGTTCAAT GGCAACAGGA 15 00 

TGGCTCCAAA ACAATGGTTC ATGGTACTAC CTAAACGCTA ATGGTGCTAT GGCGACAGGA 1560 

TGGCTCCAAT ACAATGGTTC ATGGTACTAC CTAAACAGCA ATGGCGCTAT GGCGACAGGA 162 0 

TGGCTCCAAT ACAATGGCTC ATGGTACTAC CTCAACGCTA ATGGTGATAT GGCGACAGGA 168 0 

TGGCTCCAAA ACAACGGTTC ATGGTACTAC CTCAACGCTA ATGGTGATAT GGCGACAGGA 1740 

TGGCTCCAAT ACAACGGTTC ATGGTATTAC CTCAACGCTA ATGGTGATAT GGCGACAGGT 18 0 0. 
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TGGGTGAAAG ATGGAGATAC CTGGTACTAT CTTGAAGCAT CAGGTGCTAT GAAAGCAAGC 18 60 

CAATGGTTCA AAGT AT C AG A TAAATGGTAC TATGTCAATG GCTCAGGTGC CCTTGCAGTC 192 0 

AACACAACTG TAGATGGCTA TGGAGTCAAT GCCAATGGTG AATGGGTAAA CTAAACCTAA 198 0 

TATAACTAGT TAATACTGAC TTCCTGTAAG AACTTTTTAA AGTATTCCCT ACAAATACCA 2 04 0 

TATCCTTTCA GTAGATAATA TACCCTTGTA GGAAGTTTAG ATTAAAAAAT AACTCTGTAA 2100 

TCTCTAGCCG GATTTATAGC GCTAGAGACT ACGGAGTTTT TTTGATGAGG AAAGAATGGC 216 0 

GGCATTCAAG AGACTCTTTA AGAGAGTTAC GGGTTTTAAA CTATTAAGCC TTCTCCAATT 222 0 

GCAAGAGGCT TCAATCTCTG CTAGGGTGCT AGCTTGCGAA ATGGCTCCAC GGAGTTTGGC 2 28 0 

AGCGCCAGAT GTTCCACGGA GATAGTGAGG AGCGAGGCCG CGGAATTCAC GAACTGCGAC 2 34 0 

GTTTTCTCCT TTGAGGTTAA. TCAATCGTTT CAAGTG TTCG TAGGCGATCT TCATCTTGTC 24 00 

TTCAAAGGTC AAATCAGGTA GGATTTCTCC TGTTTCAAAG TTTATGGTGG CCCTGGTTGA 246 0 

AG 2462 
(2) INFORMATION FOR SEQ ID NO: 25: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 631 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: N- terminal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

Met Met Arg Glu lie Gin Leu Asp Arg Arg Lys His Thr Gin Asn Val 
1 5 10 15 

Ala Leu Asn lie Lys Leu Ser Ala lie Lys Thr Lys Tyr Leu Arg Glu 
20 25 30 

Leu Asn Val Leu Glu Glu Lys Ser Lys Asp Glu Leu Pro Ser Glu lie 
35 40 45 

Lys Ala Lys Leu Asp Ala Ala Phe Glu Lys Phe Lys Lys Asp Thr Leu 
50 55 60 

Lys Pro Gly Glu Lys Val Ala Glu Ala Lys Lys Lys Val Glu Glu Ala 
65 70 75 80 

Lys Lys Lys Ala Glu Asp Gin Lys Glu Glu Asp Arg Arg Asn Tyr Pro 
85 90 95 

Thr Asn Thr Tyr Lys Thr Leu Glu Leu Glu lie Ala Glu Phe Asp Val 
100 105 110 

Lys Val Lys Glu Ala Glu Leu Glu Leu Val Lys Glu Glu Ala Lys Glu 
115 120 125 

Ser Arg Asn Glu Gly Thr lie Lys Gin Ala Lys Glu Lys Val Glu Ser 
130 135 140 

Lys Lys Ala Glu Ala Thr Arg Leu Glu Asn lie Lys Thr Asp Arg Lys 
145 150 155 160 



Lys Ala Glu Glu Glu Ala Lys Arg Lys Ala Asp Ala Lys Leu Lys Glu 
165 170 175 
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Ala Asn Val Ala Thr Ser Asp Gin Gly Lys Pro Lys Gly Arg Ala Lys 
180 185 190 



Arg Gly Val Pro Gly Glu Leu Ala Thr Pro Asp Lys Lys Glu Asn Asp 
195 200 205 

Ala Lys Ser Ser Asp Ser Ser Val Gly Glu Glu Thr Leu Pro Ser Ser 
210 215 220 



Ser Leu Lys Ser 
225 

Glu Ala Glu Lys 

Tyr Pro Thr Asn 
260 

Asp Val Lys Val 
275 

Lys Glu Pro Arg 
290 

Glu Ser Lys Lys 
305 



Gly Lys Lys Val 
230 

Lys Ala Lys Asp 
24 5 

Thr Tyr Lys Thr 

Lys Glu Ala Glu 
280 

Asp Glu Glu Lys 
295 

Ala Glu Ala Thr 
310 



Ala Glu Ala Glu 
235 

Gin Lys Glu Glu 
250 

Leu Asp Leu Glu 
265 

Leu Glu Leu Val 

lie Lys Gin Ala 
300 

Arg Leu Glu Asn 
315 



Lys Lys Val Glu 
240 

Asp Arg Arg Asn 
255 

lie Ala Glu Ser 
270 

Lys Glu Glu Ala 
285 

Lys Ala Lys Val 

lie Lys Thr Asp 
320 



Arg Lys Lys Ala 

Lys Val Lys Glu 
340 

Gin Pro Glu Lys 
355 



Glu Glu Glu Ala 
325 

Lys Pro Ala Glu 

Pro Ala Pro Lys 
360 



Lys Arg Lys Ala 
330 

Gin Pro Gin Pro 
345 

Pro Glu Lys Pro 



Ala Glu Glu Asp 
335 

Ala Pro Ala Thr 
350 

Ala Glu Gin Pro 

365 



Lys Ala Glu Lys Thr Asp Asp Gin Gin Ala Glu Glu Asp Tyr Ala Arg 
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370 375 380 



Arg Ser Glu Glu 
385 

Thr Glu Lys Pro 

Glu Asn Gly Met 
420 



Glu Tyr Asn Arg 
390 

Ala Gin Pro Ser 
405 

Trp Tyr Phe Tyr 



Leu Thr Gin Gin 
395 

Thr Pro Lys Thr 
410 

Asn Thr Asp Gly 
425 



Gin Pro Pro Lys 
400 

Gly Trp Lys Gin 
415 

Ser Met Ala Thr 
430 



Gly Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr Leu Asn Ala Asn Gly 
435 440 445 

Ala Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser Trp Tyr Tyr Leu 
450 455 460 

Asn Ala Asn Gly Ser Met Ala Thr Gly Trp Leu Gin Asn Asn Gly Ser 

465 470 475 480 



Trp Tyr Tyr Leu 

Tyr Asn Gly Ser 
500 

Gly Trp Leu Gin 
-515 

Asp Met Ala Thr 
53 0 

Asn Ala Asn Gly 
545 

Trp Tyr Tyr Leu 



Asn Ala Asn Gly 
485 

Trp Tyr Tyr Leu 

Tyr Asn Gly Ser 
520 

Gly Trp Leu Gin 
535 

Asp Met Ala Thr 
550 

Asn Ala Asn Gly 
565 



Ala Met Ala Thr 
490 

Asn Ser Asn Gly 
505 

Trp Tyr Tyr Leu 

Asn Asn Gly Ser 
540 

Gly Trp Leu Gin 
555 

Asp Met Ala Thr 
570 



Gly Trp Leu Gin 
495 

Ala Met Ala Thr 

510 

Asn Ala Asn Gly 
525 

Trp Tyr Tyr Leu 

Tyr Asn Gly Ser 
560 

Gly Trp Val Lys 
575 



Asp Gly Asp Thr Trp 
580 

Ser Gin Trp Phe Lys 
595 

Gly Ala Leu Ala Val 
610 

Asn Gly Glu Trp Val 
625 
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Tyr Tyr Leu Glu Ala 
585 

Val Ser Asp Lys Trp 
600 

Asn Thr Thr Val Asp 
615 

Asn Glx 
630 



Ser Gly Ala Met Lys Al 
590 

Tyr Tyr Val Asn Gly Se 
605 

Gly Tyr Gly Val Asn Al 
620 



